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DETECTTON METHOD FOR C-RAF-1 GENES 

5 



Field of the Invention 

1 0 The present invention relates to ( 1 ) a method of 

identifying an individual at an increased risk for 
developing cancer, (2) a method for determining a 
prognosis of patients afflicted with cancer, and (3) a 
method for determining the proper course of treatment for 

15 a patient afflicted with cancer. 

Background Information 

Lung cancer claims more lives in the United 

20 States than any other neoplasm (R.S. Finley, Am. Pharm. 
NS29, 39 (1989)), and of the various forms lung 
adenocarcinomas have one of the worst prognoses (T.P. 
Miller, Sftmin. Oncol. 17, 11 (1990)). The incidence of 
adenocarcinoma of the lung (ACL) in the United States is 

25 also quickly rising (I. Linnoila, Hematol. Oncol. North. 
Am. 4, 1027 (1990); J.B. Sorensen, H.H. Hansen, Cancer 
surviv. 8, 671 (1989)). In order to gain insight into 
this complex and deadly disease, a model system for its 
study has been developed. For such a model to provide 

30 clinically relevant data several criteria must be met. 

The tumors produced should be histologically equivalent to 
their human counterparts, tumor induction must be reliable 
and reproducible, and the numbers generated must be great 
enough to provide statistical significance. To satisfy 

35 these conditions a system has been created which uses two 
inbred mouse strains (NFS/n and AKR) along with 
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transplacental exposure to the potent carcinogen N-ethyl- 
N-nitrosourea (ENU) and promotion with the antioxidant 
butylated hydroxytoluene (BHT) • The resulting tumors were 
examined for altered expression or structural mutations of 
5 genes implicated in lung tumor development such as ras, 
myc, and raf oncogenes (CD. Little et al., Nature 306, 
194 (1983); P.E. Kiefer et al., Cancer Res. , 47, 6236 
(1987); E. Santos et al., Science 223, 661 (1984); S. 
Eodenhuis, N Eagi, .t. Med. 317, 929 (1987); M. Barbacid, 
10 Eur J run, invest. 20, 225 (1990); O.R. Rapp et al., i. 

TnJ- > MM . f ™- Studv " f T "" q CanCer 4 ' 162 (1988); 

M.J. Birrer et al., Ann. Rev. Med. 40, 305 (1989); G. 
Sithanandam et al., Oncogene 4, 451 (1989)). 

15 raf proto-oncogenes are evolutionarily highly 

conserved genes encoding cytoplasmic serine/ threonine 
specific kinases, which function in mitogen signal 
transduction (reviewed in O.R. Rapp et al., The Oncogene 
Handbook, T. Curran et al., Eds. (Elsevier Science 

20 Publishers, The Netherlands, 1988), PP- 115-154; O.R. 
Rapp, oncogene 6, 495 (1991)). The three known active 
members in the raf family encode phosphoproteins of 
similar size (72/74 kD for Raf-1 ; 68 kD for A-Raf-1 , and 
74 kD for B-Raf (O.R. Rapp et al., in Retroviruses and 

25 u„ ma n Penology . R- Gallo et al., Eds. (Humana Press, 

Clifton, New Jersey 1985), PP- 449-472; T.W. Beck et al., 
gasj^c ***** Res. 15, 595 (1987); G. Sithanandam et al., 
neocene 5, 1775 (1990))). Raf-1 was first identified as 
the cellular homologue of v-raf (H.W. Jansen et al , 

30 Nature 307, 218 (1984)), the transforming gene of 3611 MSV 
ToXRapp et al., J, Vlstf,. «, 914 (1983); O.R. Rapp et 

n rrnr !W — * ^. osa so, 4218 (1933)). Amino 

acid comparisons of raf family genes shows three conserved 
regions [CR1, CR2, CR3] (T.W. Beck et al., Nncleic Acxds , 
35 Res. 15, 595 (1987)); CR1 is a regulatory region 

surrounding a Cys finger consensus seguence, CR2 is a 
serine/threonine rich region, and CR3 represents the 
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kinase domain. Raf-1 has been mapped to chromosome 3p25 
in humans (S.J. O'Brien et al., Science 223, 71 (1984)), 
and this region has been found to be frequently altered in 
small cell lung carcinoma (SCLC) (J. Whang-Peng et al., 
5 Cancer Genpf .. Cvtoaenet. 6, 119 (1982); J.M. Ibson et al., 
.7- cell. Biochem. 33, 267 (1987)), familial renal cell 
carcinoma (A.J. Cohen et al., N. Engl. J. Med. 301, 592 
(1979); G. Kovacs et al., Tnt. J. Cancer 40, 171 (1987)), 
mixed parotid gland tumors (J. Mark et al. , flereditas, 96, 
10 141 (1982)), and ovarian cancer (K. Tanaka et al., Cancer 
Genet. Cvtoaenet. 43. 1 (1989)). 

Raf genes are differentially expressed in 
various tissues (S.M. Storm et al., Oncogene 5, 345 

15 (1990)). c-raf-1 has been found to be expressed 

ubiquitously, though absolute levels vary between tissues. 
A -raf- 1 is present predominantly in the urogenital 
tissues, whereas B-Raf is most abundant in cerebrum and 
testis. The ubiquitous c-Ra£-1 kinase is regulated by 

20 tyrosine and serine phosphorylations that result from 

activated growth factor receptor kinases (D.K. Morrison et 
al., Cell 58, 648 (1989); D.K. Morrison et al., Proc. 
Ma n A^. sci. USA 85, 8855 (1989); K.S. Kovacina et 
al., »t. Biol. Chem. 265, 12115 (1990); P.J. Blackshear et 

25 al., J_; Biol- Chem. 265, 12131 (1990); M.P. Carroll et 

al., .7- Biol. Chem. 265, 19812 (1990); J.N. Siegel et al., 

Biol. Chem. 265, 18472 (1990); B.C. Turner et al., 
Prnr. Natl. a»*d. Sci. USA 88, 1227 (1991); M. Baccarini 
et al., EMBO J. 9, 3649 (1990); H. App et al., Mol. Cell. 

30 Biol. 1 1 , ■ 91 3 (1991 )) . Raf-1 operates downstream of Ras 
in mitogen signal transduction as judged by experiments 
using antibody microinjection (M.R. Smith et al., Nature 
320, 540 (1986)), c-raf-1 antisense expression constructs 
(W. Kolch et al., Nature 349, 426 (1991)), dominant 

35 negative mutants (W. Kolch et al., Nature 349, 426 

(1991)), and Raf revertant cells. Studies with NIH3T3 
cells and brain tissue demonstrated that mitogen treatment 
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induces Raf-1 kinase activity and causes a transitory 
relocation of the active enzyme from the cytoplasm to the 
nucleus and perinuclear area (2. Olah et al., Exp. Brain. 
Res . (in press); O.R. Rapp et al., in Cold .Spring Harbor 
5 SH EPSia ™ om -"^" ^oloav. Vol. LIII, Eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
1988) pp. 173-184). 

Raf-1 coupling has been examined in more than a 
10 dozen receptor systems and all strong mitogens stimulated 

Raf-1 kinase activity (O.R. Rapp, Qncoggne 6, 495 (1991); 

D~I Morrison et al., Cgii 58, 648 (1989); D.K. Morrison 

et al , rrn- ™* 85 ' 8855 (1989); K " S - 

Kovacina et al., J Mfll. SfegHL 265, 12115 (1990); P.J. 
15 Blackshear et al., J Biol. Chem.. 265, 12131 (1990); M.P. 

Carroll et al., J Biol, Chem. 265, 19812 (1990); J.N. 

Siegel et al.. J. Biol. Chem., 265, 18472 (1990); B.C. 

Turner et al. , rrnr H»H *™n. Sci. PSA 88, 1227 (1991); 

M. Baccarini et al., EMBO J. 9, 3649 (1990); H. App et 
20 al., Mo] Cell, Biol. 11, 913 (1991)), and this 

stimulation correlated with an increase in Raf-1 

phosphorylation leading to a shift in apparent molecular 

weight. 



25 



30 



35 



SUMMARY OF THE INVENTION 

It is an object of this invention to provide a 
method of identifying an individual at an increased risk 
for developing cancer. 

It is another object of this invention to 
provide a method for determining a prognosis in patients 
afflicted with cancer. 

It is a further object of this invention to 
provide a method for determining the proper course of 
treatment for a patient afflicted with cancer. 
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Further objects and advantages of the present 
- invention will be clear from the description that follows. 

In one embodiment, the present invention relates 
5 to a method of identifying an individual at an increased 
risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
10 classifying an individual having one or more 

mutations in the region as having an increased risk for 
developing cancer. 

In another embodiment, the present invention 
15 relates to a method for determining a prognosis in 
patients afflicted with cancer, comprising: 

amplifying a region of the c-raf-1 gene; 

analyzing products of the amplification for evidence 

of mutation; and 
20 classifying patients having no mutation in said 

region as being less likely to suffer disease relapse or 
having an increased chance of survival than those patients 
having one or more mutations in said region. 

25 In a further embodiment, the present invention 

relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer, comprising: 
amplifying a region of the c-raf-1 gene; 
analyzing products of said amplification for evidence 

30 of mutation; 

identifying a patient having at least one mutation in 

said region, which patient may require treatment proper 

for patients having a lesser chance of survival or 

decreased time to relapse; and 
35 identifying a patient lacking mutations in said 

region, which patients may require treatment proper for 
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patients having a greater chance of survival or being less 
likely to suffer disease relapse. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 - Effect of BHT promotion on END 
tumorigenesis in NFS/n x AKR mice. The X-axis represents 
percent tumor induced mortality within each group, and the 
y-axis reflects age in weeks. All animals were exposed to 

10 END" transplacental^ at a dose of 0.5mM/Kg of mother's 
body weight on day 16 of gestation (presence of vaginal 
plug was scored as day one). At two weeks of age mzce 
were weaned into two separate groups and separated by sex. 
Both groups were housed in identical cages and supplied 

15 with food (Purina Lab Chow) and water ad libitum. 

Beginning at three weeks of age, group 2A (0) was given 
weekly intraperitoneal (i.p.) injection of com oil (0.1 
ml), and group 2B (o) received weekly i.p. injections of 
BHT (20 mg/Kg of body weight) dissolved in corn oil. 

20 Administration of BHT reduces the mean age of mortality 
from approximately 20 weeks to 13, and decreases the 
initial age of mortality. These curves are significantly 
different (ps 0.001) as judged by a 2-tailed Cox test. In 
both groups the rate of tumorigenesis was identical for 

25 males and females. 

Figure 2. Northern blot analysis of proto- 
oncogene expression levels in ENU induced tumors. 

Figure 3. Diagnostic digestion of PCR amplified 
Ki-ras genes from END induced tumors. Genomic DNA was 
isolated from a cesium chloride gradient during RNA 
preparations. In each case 10 ng was amplified via PCR 
(95°C, 5 min. followed by 35 cycles of 95°C, 1 mm. -» 
55'C, 1 min. -* 72% 1 min.) with 2 units of Tag I 
polymerase. The primers used (K1; 5'- 
AACTTGTGGTGGTTGGACCT— 3 ' -* (SEQ ID N0:6) and K2; <= 3'- 



30 



35 
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GTCTTAGTGAAACACCTACT-5 ' (SEQ ID NO:7)) generate a 79 b.p. 
product. The primer K1 ends at codon 12 and contains a 
mismatch from normal mouse (Ki-ras sequence its 18th 
nucleotide (G - C) creating a BstNI site (CCTGG) in 
5 conjunction with a normal codon 12 (GGT). Digestion of 
amplified product from a normal allele with BstNI produces 
two products of 19 and 60 b.p., whereas a mutation in one 
of the first two positions of codon 12 will eliminate the 
BstNI site. The presence of two normal alleles results in 
10 all of the product being cleaved and the presence of one 
mutant and one normal allele will result in only half of 
the product cut. In the three panels each sample was run 
in duplicate, uncut and cut with BstNI . F1 is DNA from an 
untreated NFS/n X AKR F1 mouse, and MCA5 is a murine cell 
15 line known to harbor a mutant K-ras codon 12 allele. One 
lymphoma (24Ly) and one cell line (117; derived from a 
lung adenocarcinoma) display a mutated Ki-ras codon 12 
allele; however, 24Ly was a passaged tumor and examination 
of the original tumor showed two normal alleles indicating 
that this mutation was acquired during passaging. 



20 



25 



Figure 4. c-raf-1 RNAse protection analysis of 
ENU induced tumors. The probe used was a 32P labeled 
antisense transcript from the 3' non-coding region of a 
mouse c-raf-1 cDNA to the 3' most Stul site. 
Hybridization of this probe with normal RNA results in a 
protected fragment of 1.2kb covering the region encoding 
the Raf-1 kinase domain. One ug of poly(A) + RNA from each 
tumor and 5 ug of F1 RNA (in order to get comparable 
30 signals) was hybridized for 12 hours at 52«C with 200,000 
cpm of C P labeled mouse c-£af antisense transcript. 
Hybrids were then digested for 30 minutes with 25 ug RNAse 
A and 33 units of RNAse T1 at room temperature. Digested 
hybrids were then incubated with 50 ug of proteinase K, 
35 phenol/chloroform extracted, ethanol precipitated, and 
resuspended in loading dye containing 80% formamide. 
Samples were then run on 6% polyacrylamide denaturing 
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sequencing gels at 65 watts. Gels were vacuum dried at 80 
degrees C and exposed to X-ray film. Probe is -digested 
probe alone; tRNA is probe hybridized to non-specxf re RNA; 
v-raf is probe hybridized to RNA from a v-raf transformed 
5 cell line and the bands detected represent single base 
mismatches between murine c-raf and v-«f; NFS/AKR F1 xs 
probe hybridized with RNA from a normal (untreated) mouse; 
24 LY is probe hybridized with RNA from a lymphoma; and 
the remaining lanes are probe hybridized with RNA xsolated 
10 from lung tumors- 

Figure 5. Schematic of Raf-1 protein showing 
sites of END induced mutations. CR1 , CR2, and CR3 
represent conserved regions 1, 2 and 3. cDNAs were made 

15 from tumor derived poly(A) + RNA using MoMuLV reverse 
transcriptase. Primers (HR1 sequence and MR2 sequence) 
encompassing a 435 base pair region c-raf were then used 
to amplify this region via PGR. The amplification mxxture 
W as then run on 1 .7% agarose gels and the 435 bp product 

20 isolated. This isolated fragment was then treated wxth T4 
polymerase and cloned into the Hindi sxte of Ml3m P 18 for 
sequencing. Another set of primers (EMR1 sequence and 
EMR2 sequence) was designed containing EcoRI sxtes at the 
termini and used to amplify a 609 base pair region 

25 (encompassing the original 435 base pair re ^° n) ' 

isolated products from these reactions were then dxgested 
with EcoRI and cloned into the EcoRI site of KS. 
Sequencing reactions were carried out using the Sequenase 
kit (OSB) according to the recommended protocols for 

30 single and double stranded sequencing. Sequencing 

reactions were run on 6% polyacrylamide denaturing gels at 
65 watts. Gels were vacuum dried at 80 degrees C and 
exposed to X-ray film. In each case a normal allele was 
also sequenced along with the mutant allele. 
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Figure 6. Schematic for Identifying c-raf-1 
mutations. Primers 1 and 2 are shown in SEQ ID NO: 8 and 
SEQ ID 1X0:9, respectively. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods that 
involve amplifying a region of the c-raf-1 gene (the 
sequence of a mouse c-raf-1 gene is shown in SEQ ID NO: 10; 
10 the nucleotide and corresponding amino acid sequence of a 
human c-raf-1 gene is shown in SEQ ID NO: 11 and SEQ ID 
NO: 12, respectively). 

In one embodiment, the present invention relates 

15 to a method of identifying an individual at an increased 
risk for developing cancer (preferably, lung cancer, T- 
cell lymphomas, renal cell carcinoma, ovarian carcinoma, 
and mixed parotid gland tumors) comprising: amplifying a 
region (preferably by using the polymerase chain reaction 

20 method(PCR) -or by cloning techniques) of the c-raf-1 gene 
of the individual (SEQ ID N0:11)(in one preferred 
embodiment, the region encodes amino acids 514 to 535 of 
SEQ ID NO: 12); analyzing products of the amplification for 
evidence of mutation (preferably by DNA sequencing of the 

25 region) and classifying an individual having one or more 
mutations in the region as having an increased risk for 
developing cancer. In one prefered embodiment, the region 
encodes amino acids 500 to 550 of SEQ ID NO: 12 or amino 
acids 450 to 630 of SEQ ID NO: 12. In another prefered 

30 embodiment, the PCR method employs a primer comprising the 
sequence shown in SEQ ID NO: 7 and a primer comprising the 
sequence shown in SEQ ID NO: 8. In another prefered 
embodiment, the method comprises the steps shown in Figure 
6. 

35 m another embodiment, the present invention 

relates to a method for determining a prognosis in a 
patient afflicted with cancer (preferably, those cancers 
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listed above). The method comprises: amplifying the 
region of the c-raf-1 gene as described above; analyzing 
products of the amplification for evidence of mutation as 
described above; and classifying a patient having no 
5 mutation in the region as being less likely to suffer 
disease relapse or having an increased chance of survival 
than a patient having one or more mutations in the region. 

In another embodiment, the present invention 
10 relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer (preferably, 
those cancers listed above), comprising: amplifying a 
region (described above) of the c-raf-1 gene as described 
above; analyzing products of the amplification for 
15 evidence of mutation as described above; identifying a 

patient having at least one mutation in the region, which 
patient may require treatment proper for patients having a 
lesser chance of survival or decreased time to relapse; 
and identifying a patient lacking mutations in the region, 
20 which patients may require treatment proper for patients 
having a greater chance of survival or being less likely 
to suffer disease relapse. 

Administration of therapeutic agents (cytotoxic 
25 or cytostatic) tailored to recognize the mutant Raf-1 
protein but not normal Raf-1 could specifically target 
tumor cells for death of growth inhibition. Such agents 
could be comprised of cytotoxic T-cells, antibodies, 
and/or specifically designed chemical compounds. 

30 

The following Examples demonstrate consistent 
point mutations of the c-raf-1 proto-oncogene , within a 
small region of the kinase domain, in a mouse model for 
chemical tumor induction. This is the first demonstration 
35 of point mutated raf genes in yiyo, and the first 

isolation of activating in vivo point mutations in the 
kinase domain of a proto-oncogene. The tumors examined 
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show a selective specificity for Raf-1 mutations as 
another family of genes, the ras proto-oncogenes which are 
frequently activated by point mutation in both animal and 
human tumors (S. Rodenhuis et al., Am. Rev, Respir. Pis, 
5 142, S27-30; T.R. Devereux et al., Carcinogenesis 12, 299 
(1991)), is not involved. 

The present invention is described in further 
detail in the following non-limiting examples. 

10 

EXAMPLES 

The following protocols and experimental details 
are referenced in the examples that follow: 

15 RNA Isolation. Tumors were excised, a small portion 
minced in PBS (phosphate buffered saline solution) for 
passaging in nude mice, frozen immediately in a dry 
ice/ethanol bath, and stored at -70° until RNA extraction. 
Frozen tissues were minced on wet ice in a guanidine 

20 thiocyanate buffer (4M guanidine thiocyanate 10mM EDTA, 2% 
N-lauryl sarcosine, 2% beta-mercaptoethanol, 10mM Tris 
(pH=7.6)), disrupted in a Dounce homogenizer, and 
extracted three times with phenol: chloroform: isoamyl 
alcohol (24:24:2). Supernatants were then transferred to 

25 SW41 tubes, 100 jig of cesium chloride per ml was added to 
the supernatant which was then underlayed with one half 
saturated cesium chloride in 10mM EDTA (pH=7.0; index of 
refraction 1.3995-1.4000), and centrifuged at 25,000 rpm 
for 20 hours in a Sorvall SW-41TI rotor using a Beckman 

30 model L5-50 ultracentrifuge. Supernatants were removed 
and RNA pellets dissolved in 4 ml resuspension buffer ( 1 0 
mM Tris-HCl pH=7.6, 5% beta-mercaptoethanol, 0.5% N-lauryl 
sarcosine, 10 mM EDTA), extracted once with 
phenol: chloroform: isoamyl alcohol, sodium acetate added to 

35 0.1 2M and RNA precipatated with two volumes ethanol at - 
20°C overnight. Precipitates were centrifuged at 9,000 rpm 
in a Sorvall SS-34 rotor for 30 minutes, and pellets 
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10 



redissolved in RNA sample buffer (10 mM Tris pH=7.4, M 
EDTA, 0.05% sodium dodecyl sulfate) and concentrations 
determined by absorbance at 260 nm. Poly (A)* RNA was 
isolated by binding to oligo dT cellulose columns in high 
salt CIMf Tris PH=7.4, ImM EDTA, 0.05% SDS, SOOmM Had), 
and eluting with RNA sample buffer heated to 40°C. 

northern Blottir^ 5 »g poly(A)* RNA per lane was ethanol 
precipitated, desiccated, resuspended in loading buffer 
(20mM HOPS P H=6.8, 5mM sodium acetate, ImM EDTA, 50% 
formamide, 6% formaldehyde), heated at 65°C for 5 min., 
quick chilled on wet ice for 10 min., and electrophoresed 
through a 0.7% agarose gel containing 2.2 M formaldehyde, 
20mM MOPS CpH=6.8], 5mM sodium acetate, and ImM EDTA. 
Gels were then blotted overnight onto nitrocellulose 
filters via capillary transfer in 20X SSC, filters were 
washed in 3X SSC for 10 min. and baked at 80»C for 2 
hours . 

^^, t j onS . Filters were prehybridized at 42'C in 5X 
SSC, 50% formamide, 20mM sodium phosphate P H=6.8, 200 
ug/ml PVP-40, 200 yg/1 ficoll 400, 200 ug/ml bovine serum 
albumin, and 200 ug/ml sonicated sheared salmon sperm DNA. 
Blots were . then hybridized with 500,000 cpm/ml of random 
25 primed *P labeled probes overnight at 42"C in 

prehybridization solution with 5% dextran sulfate. Blots 
were washed with agitation in 2X SSC, 0.1% SDS at room 
temperature six times for 20 minutes each wash, then 
washed once at 45"C in 0.1X SSC for 15 minutes. Fxlters 
30 were exposed to X-AR 5 film at -70 °C. 



20 



F.y AMPLE 1 
Tnmnr Induction 



35 



NFS female mice were mated with AKR males and 
pregnant females given a transplacental injection of 1- 
et hyl-1-nitrosourea (END) at a dosage of 0.5 mM/Kg 
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mother's body weight on day 16 of gestation, counting plug 
date as day one. ENU was chosen for tumor induction since 
it is a very potent direct acting carcinogen capable of 
modifying any base in vivo (Singer, B. et al., 1983. 
5 Molecular ttinloav of Mutage ns and Carcinogens. Plenum 
Press, New York). ENU alkylates all tissues with roughly 
the same efficiency (E. Scherer et al.. Cancer Lett. 46, 
21 (1989)) and has a very short half life in vivo (E.M. 
Faustman et al., Teratology 40, 199 (1989)) allowing 

10 specific mutagenesis of tissues which are mitotically 
active at a particular time. NFS and AKR were chosen as 
parental strains based on earlier studies which showed 
them to be particularly susceptible to ling tumors 
following ENU exposure (B.A. Diwan et al., Cancer Res. 34, 

15 764 (1974); S.L. Kauffman, JNCI 57, 821 (1976)). With 
this procedure nearly 100% of the offspring develop lung 
adenocarcinomas and approximately 70% develop, in 
addition, T-cell lymphomas with a mean latency of 
approximately 20 weeks. In order to achieve more rapid 

20 tumor development, weanling mice were treated with weekly 
injections of a tumor promoter, the antioxidant butylated 
hydroxy toluene or BHT (20mg/kg body weight dissolved in 
corn oil). BHT was used as it has been demonstrated to 
cause lung, lesions and hyperplasia when injected into mice 

25 (A.A. Marino et al., pt-qh. Soc. Kxp. Biol. Med. 140, 122 
(1972); H. Witschi et al., Proc. Soc Exp. Biol. Med, , 
147, 690 (1974); N. Ito et al., CRC Crit f Rev. Toxicol. 
15, 109 (1984)). In the present system it nearly doubles 
the rate at which tumors develop. Figure 1 compares tumor 
' 30 induced mortality with age of animals for those receiving 
ENU alone, and those receiving ENU and promoted with BHT. 
These curves demonstrate that when BHT is given the mean 
age of tumor induced mortality decreases from 
approximately 20 weeks to around 12, and there is also a 

35 decrease in initial latency. These curves are 

significantly different with a confidence limit greater 
than 99.99% using a 2-tailed Cox test. In addition, BHT 
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promotion, while increasing the rate at which tumors 
develop, does not affect the tumor spectrum. 



EXAMPLE 2 
nnr-ooepp Rmresslon 



Northern blot analysis revealed elevated levels 
of c-raf-1 , as compared to normal tissue, in every tumor 
examined (Figure 2), and Western blot analysis showed that 

10 protein levels correlated with message levels (O.R. Rapp 
efc al., in Onrog^s and Cancer, S.A. Aaronson et al., 
Eds (Tokyo/VNU Scientific Press, Tokyo, 1987) pp. 55-74). 
in addition, in cell lines derived from primary tumors, 
Raf-1 protein kinase activity was shown by immune-complex 

15 kinase assays to be constitutive. Further analysis of 
other oncogenes revealed no consistent pattern of 
expression except for ras and myc family genes. In the 
case of the inyc family, one member (either c-, N-, or L- 
mvc) was overexpressed but never more than one. For the 

20 ras genes, at least one member (Ki-, Ha-, or N-ras.), and 
o^en more than one, was expressed at high levels when 
compared with the normal tissue. In addition all 
oncogenes examined via Northern analysis exhibited full 
length, normal sized transcripts. 

ras genes were considered likely candidates for 
mutational activation since oncogenic forms of Ki-ras have 
previously been observed in lung tumors (S. Rodenhuis et 

al., - — — ^- 142 < S27 - 30 '' T ' R - D6VereUX Ct 

30 al., ^no^nesis 12, 299 (1991)) and ENU is a point 
mutagen (Singer, B. et al., 1983. Molecular Biology of 
Mutagens and ^^inocens, Plenum Press; New York). A 
systematic analysis of various ras codons known to be 
involved in oncogenic activation was therefore performed . 
35 Ha-, Ki-, and N— ras were examined at codons 12, 13, and 
61 for potential mutations via RNAse protection assays 
(R.M. Hyers et al., Sci^ 230, 1242 (1985); E. Winter et 



WO 93/06248 



PCI7US92/07817 



-1 5- 

al. f Prnr. Natl. . Sci ■ USA 82, 7575 (1985)), PCR 

amplification followed by subsequent sequencing (F. Sanger 
et al., -t MoT. Biol. 13, 373 (1975)), and PCR 
amplification followed by diagnostic restriction digests 
5 (W. Jiang et al., rinroaene 4, 923 (1989)). PCR 

amplification creating diagnostic enzyme sites is a very 
efficient way of examining alleles for mutations at known 
sites and involves designing a PCR primer whose 3' end 
lies next to and produces a novel restriction site 
10 encompassing the codon of interest. Following 

amplification, PCR products from normal alleles will 
contain the new restriction site, while mutant alleles 
will not. Digestion of the product from tissue with two 
normal alleles results in all product being cut; however, 
15 if one allele contains a mutation, only half of the 

product will be digested. Figure 3 shows the results of 
amplification and diagnostic digestion applied to Ki-ras 
codon 12 in several tumors and cell lines. The first 
panel is from a set of lymphomas. F1 is DNA from a normal 
20 untreated mouse and both alleles are cut by BstNl, 

indicating the presence of two normal alleles. MCA5 is a 
murine cell line known to contain a Ki-£as. codon 12 
mutation (L.F. Parada et al., M"1 ■ Cmll. Biol. 3, 2298 
(1983)), and only the amplified normal allele is cleaved. 
25 Of the five tumors shown in the second panel, one shows a 
mutant Ki-ra£ allele. The next panel shows some of the 
lung tumors tested and none of them exhibit a mutant 
allele, and the final panel shows tumor derived cell 
lines. The first three are from lymphomas and the last 
30 three from lung adenocarcinomas. One lung tumor line 
(#117) has a Ki-ras 12 mutation that was not present in 
the primary tumor but came up upon transplantation. This 
analysis has been performed with Ki, Ha and N-ras genes at 
codons 12 and 61 . Of all the tumors and cell lines 
35 examined by this method for mutations of the three ras 
genes at codons 12 and 61, the two shown here were the 
only ones detected. Examination of codon 13 was done by 
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35 



PGR amplification of genomic DNA surrounding codon 13 
followed by cloning into KS + (Stratagene) and double 
stranded sequencing. Table I summarizes the ras. mutation 
data. The most notable point from this table is the 
5 conspicuous lack of ras mutations in these tumors. In 
fact the number of ras. mutations is much lower than would 
be expected for a sampling of spontaneous tumors (S. 
Rodenhuis et al., *m BSS *»splr. 142 ' S27 " 30; T * R * 

Devereux et al., Carcinogenesis 12, 299 (1991); J.L. Bos, 
10 Cancer_Re^ 49, 4682 (1989)). Having eliminated ras genes 
as playing a primary role in the genesis of these ENU 
induced tumors, c-raf-1 was investigated for possible 
small or point mutations. 

15 TABLE I 

Tun**""" T.m eS PnsiHvft for ras Mutations 

rodon 12 Codon 13 Codon 61 





Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


20 Ha-ras 
Ki/ras 
N-ras 


0/10 
T/10 
0/10 


0/6 
1/6. 
0/6 


0/6 
0/6 
0/6 


0/2 
0/2 
0/2 


0/10 
0/6 
0/10 


0/6 
0/2 
0/6 



• This was a second passage tumor in which the original 
25 tumor did not contain a Ki-ras mutation. 

Table I: Summary of mutation analysis for Ha-, Ki-, and 
N-ras at codons 12, 13, and 61. Each box displays the 
number of mutations detected, over the number of tumors 
30 and tumor derived cell lines examined via RNAse 

protection, sequencing or diagnostic digestion, for each 
of the nine codons. 



CTAMPLE 3 
Mutation* *n Raf-1 



Since no point mutations had been described 
for rai genes in vivo, as had been for the ras genes <E. 
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Santos et al., Science 223, 661 (1984); S. Rodenhuis, Nj. 
Rntrl. J. Med. 317, 929 (1987); M. Barbacid, Eur. J. 
run. Invest. 20, 225 (1990); F. Sanger et al., J . Mol. 
Biol, 12, 373 (1975)), point mutations were screened for 

5 using RNAse protection assays (R.M. Myers et al., 

Science 230, 1242 (1985); E. Winter et al., Proc. Natl. 
Anad. Sci. USA 82, 7575 (1985)). Figure 4 shows a 
typical protection assay using a c-£af.-1 probe. In this 
experiment the probe used covered the 3' end of raf-1 

10 from the 3' most StuI site to the end of the coding 

sequence. The first lane is a marker (pBR322 digested 
with Haelll), the second shows the probe alone 
undigested, the third lane shows the probe hybridized to 
unrelated RNA in this case tRNA, the fourth lane shows 

15 hybridization with v-raf transformed cells and the lower 
bands represent cleavage at points where the mouse c- 
raf- 1 gene differs from v-raf- The fifth lane shows 
hybridization with RNA isolated from a normal lung of an 
untreated F1 mouse, the next lanes are RNA isolated from 

20 several tumors. In the case of the normal RNA, only 

one, fully protected, band is detected while in the case 
of the tumors two major bands are seen after digestion. 
20 out of 20 tumors analyzed in this fashion showed this 
extra band. These data demonstrate the following major 

25 points: 1) there is a tumor specific alteration in c- 
raf-1 that results in a region of non-homology 
recognizable by either RNAse A or T1 ; 2) The alterations 
are confined to the same region of one allele as two 
bands of equal size are present in the tumor lanes, and; 

30 3) both alleles were expressed at comparable levels as 
both bands are of approximately equal intensity. In the 
assay shown 5 ug of poly (A) + RNA was hybridized from 
normal tissue, and 1 ug was used from the tumors. This 
was necessary to get signals that could be compared on 

35 the same gel due to the overexpression of c-raf-1 in the 
tumors. By running these assays with various, markers it 
was possible to estimate the approximate site of the 



WO 93/06248 



PCT/US92/07817 



-18- 



10 



15 



20 



25 



30 



35 



alteration^, to be in the vicinity of the exon 14 exon 
1S junction. In order to define the precise genetic 
elteration or alteretions, PCR primers were 
which would generate e 600 bp fragment encompassrng thxs 
region. cDHAs from tumor derived SNA were then 
amplified and cloned into KS* (Stratagene, for double 
stranded sequencing. The sequencing results from 
several tumors are shown in Figure 5. The top portron 
of Figure 5 presents a cartoon of the mouse Raf-1 
Protein! There are three conserved regions CR1. CR2 and 
£3 with CR3 representing the Kinase domain The probe 
used in the RHase protection assays covers the indicated 
area, and the PCR primers amplified the 
region. Sequencing through this area revealed a variety 
of mutations Just do^stream of the *PE sit.^ These 
mut ants are shown in an expanded versio 
of Figure 5 (See also SEQ ID K0:1 for normal mouse 
sequence and SEQ ID H0:2. 3, 4, and 5 for mutant 
sequences). These mutants were isolated from four 
separate tumors, and in each case a normal allele (SEQ 
ID »:1> was also sequenced. Repeating the cDH& 
synthesis, PCR amplification, cloning and sequencing 
J,es the same sequence and normal tissue shows no 
Nations demonstrating that these alterations are not 
factual, sequence covering the amplif ed region has 
„.en examined and it is interesting that all of these 
changes occur within a very small region of the raf 

in fact the region where these <"?« 
overlaps an epitope shared by monoclonal anybodies 
generated against raf (w. Kolch et al., apneas!* 5, 713 
Imotranrcomputer modeling of the protein shows thrs 

1 be a hydrophilic domain, the structure of * oh is 
predicted to be altered by these mutations. This 
indicates a biologically important region for th e 
molecule and indeed the first of these mutation tested 
Z ^ll cell assays, after cloning into a retroviral 
expression vector (E1-neo, iO. Heidecker et al., KsA, 
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Cell- Biol. 10, 2503 (1990))), was found to be weakly 
transforming when driven by a Moloney LTR. The 
transformation efficiency was comparable to EC2, a 
previously characterized mutation of human c-raf-1 cDNA 
5 (G. Heidecker et al., Mol. Cell. Biol. 10, 2503 (1990); 
C. Wasylyk et al., Mol. Cell. Biol. 9, 2247 (1989)) and 
-20 fold lower than the v-raf oncogene. 



★ * * * * 
10 All publications mentioned hereinabove are 

hereby incorporated in their entirety by reference. 

While the foregoing invention has been 
described in some detail for purposes of clarity and 
15 understanding, it will be appreciated by one skilled in 
the art from a reading of this disclosure that various 
changes in form and detail can be made without departing 
from the true scope of the invention and appended 
claims . 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Rapp, Ulf R. 

v Storm, Stephen H. 

(ii) TITLE OF INVENTION: DETECTION METHOD FOR C-RAF-1 GENES 
(iii) NUMBER OF SEQUENCES: 12 

(1V) "ffffS^'W^ ROSENBERG, P.C. u 

(5) StSeCT: Suite 400, 133 Carnegie Way, N.W. 

(C) CITY: Atlanta 

(D) STATE: Georgia 

(E) COUNTRY: USA 

(F) ZIP: 30303-1031 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(6) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Perryman, David G. 

(B) REGISTRATION NUMBER: 33,438 

(C) REFERENCE/DOCKET NUMBER: 1414.0421 

fix) TELECOMMUNICATION INFORMATION: 
11 (A) TELEPHONE: (404) 688-0770 
(B) TELEFAX: (404) 688-9880 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 
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Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
r 115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 190 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 • 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
2go 295 300 
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Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 iLS 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 J3U 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 

355 360 
His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 3 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 3 9° 395 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 

m**r 440 *»HD 

Gly Met Asp Tyr Leu His 
450 455 460 



435 440 
Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 



He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 *' a 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 



485 



Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 * ,yi> 
Ala Pro Glu Val lie Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 

515 520 
Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 
Glu Leu Pro Tyr Ala His lie Asn Asn Arg Asp Gin He lie Phe Met 
545 550 M3 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 



565 

580 



Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
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Lys Glu Glu Arg Pro Leu Phe Pro Gin He Uu Ser Ser lie Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 " 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr lie 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 • * 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys AU Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 WO 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 tU3 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 " u 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 z " 

Asp Ser Arg Met He Glu Asp Ala lie Arg Ser His Ser Glu Ser Ala 
275 cov 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 -* uu 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

340 345 
Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 

355 360 
His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

Ala 

385 390 395 



370 " 375 
Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 



Arg His Val Asn lie Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 



405 



Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 



420 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 
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Ser Asn Asn He Phe Leu His Glu Gly Leu 
475 480 

Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
490 495 

Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
505 510 

Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
520 525 

Gly He Val Leu Tyr Glu Leu Met Ala Gly 
535 540 

He Asn Asn Arg Asp Gin He He Phe Met 
555 560 

Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
570 575 

Arg Leu Val Ala Asp Cys Val Lys Lys Val 
585 590 

Phe Pro Gin He Leu Ser Ser He Glu Leu 
600 605 

Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
615 620 

Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
635 640 



(2) INFORMATION FOR SEQ ID NO: 3: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 



He 
465 


His 


Arg 


Asp Met Lys 
470 


Thr 


Val 




He Gly Asp 
485 


Ser 


Gly 


Ser 


Gin Gin Val 
500 


Ala 


Pro 


bill 

515 


Val Val Arn 

vai vai f\rg 


Ser 


Asp 
530 


Val 


Tyr Ser Tyr 


Glu 
545 


Leu 


Pro 


Tyr Ala His 
550 


Val 


Gly 


Arg 


Gly Tyr Ala 
565 


Cys 


Pro 


Lys 


Ala Met Lys 
580 


Lys 


Glu 


Glu 
595 


Arg Pro Leu 


Leu 


Gin 
610 


His 


Ser Leu Pro 


Leu 
625 


His 


Arg 


Ala Ala His 
630 


Thr 


Ser 


Pro 


Arg Leu Pro 
645 
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Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

50 55 
Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 ' 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 " 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 135 
Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 133 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 

165 A 
Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 

Thr Val Gly Asp Ser Gly 
195 ZOO 205 



180 l fl5 
Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 



Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 
Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 " 3 

Asn Thr Ser Ser Pg Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 

260 265 
Asp ser Arg Het He Glu Asp Ala He Arg Ser His Ser Glu Ser AU 

ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leo Ser Pro Thr Gly 
290 295 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
310 313 



305 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 
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Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
3 595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 
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Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 



625 



630 



Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 648 amino acids 
B) TYPE: amino acid 



Si 



TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
I 5 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 



20 



Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp top Gly Lys Leu 



35 



Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

50 55 
Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 

115 1ZU 
Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 



130 



Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 lDD 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 



180 

195 



Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
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Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
K 275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

TrD Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys. Val Val Asp Pro Thr Pro 
370 .375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arc His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
" 405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 
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Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 5 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Ser Gin 
515 520 3" 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 55b 3 ° w 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 5/Q 3/3 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 ° u: > 

Leu Gin Hfs Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 

610 6 15 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 0J3 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:5: 

ft) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 *° 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 ou 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
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Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 HO 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys" 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
3 180 185 190 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 

330 wws> 



325 

340 



Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
~ *~ 345 3ou 



Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 
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Hls Gty Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 3 95 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
3 405 410 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 



420 



425 



Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 



465 



470 



475 



Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 



485 



490 



Ser Gly Ser Gin G1n Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 

5Z5 



515 



520 



Ser Thr Cys Thr Phe Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

535 540 



530 



Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 



Val 



Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
— - 570 



565 



575 



Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
- — 585 



580 



590 



Lys Glu Glu Arg Pro Leu Phe Pro Gin He lieu Ser Ser He Glu Leu 



595 



600 



Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 6Z0 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 



625 



630 



640 



Thr Ser Pro Arg Leu Pro Val Phe 
645 
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(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

AACTTGTGGT GGTTGGACCT 20 

(2) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TCATCCACAA AGTGATTCTG 20 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: . 
AGGAGACCAA GTTTCAGATG 20 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GCGTGCAAGC ATTGATATCC 20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1947 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANOEDNESS; double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:10: 



ATGGAGCACA 


TACAGGGAGC 


TTnrAAfArc ATPACPAATC 

TTGuAAuALb A I wAb lmm i « 


GCTTTGGACT 


CAAAGATGCG 


60 


GTGTTTGATG 


GCTCCAGCTG 


~. a -r-/»*rr»f»p/*T AOPATTfiTTP 
CATCTCCCC 1 ALLA 1 1 b 1 1 L. 


AGPAGTTTGG 


CTATCAGCGC 


120 


CGGGCCTCAG 


ATGATGGCAA 


os**^/*- a r+r*f+ »t TPTT/^TAACA 

6CTCAC66AT 1 t 11 L i AAbA 


f AAGCAATAC 


TATCCGGGTT 


180 


TTCTTGCCGA 


ATAAGCAAAG 


6ACTGTBGIL AAlblbLbbH 


ATGGAATGAG 


CTTACATGAC 


240 


TGCCTTATGA 


AAGCTCTGAA 


f*fs-r/% *n a f»o/* f*TPPACPPAC 

GGTGAGAGGC ClGCAfaLLAfa 


AGTGCTGTGC 


AGTGTTCAGA 


300 


CTTCTCCAGG 


AACACAAAGG 


•^•» PCPTTACATT 

TAAGAAAGCA Lb L I t AbA i i 


GGAACACCGA 


TGCCGCCTCT 


360 


CTGATTGGAG 


AAGAACTGCA 


AGTGGATTTT UbbAlwAib 


TTCCCATCAC 


AACTCACAAC 


420 


TTTGCTCGGA 


AAACGTTCCT 


s% a a n/»TT/*/'A TTOTCTCAPA 

GAAGCTTGCA 1 1 C 1 b I bALA 


TCTGTCAGAA 


GTTCCTGCTA 


480 


AATGGATTTC 


GATGTCAGAC 


rr/>T/*^rTlt f* A AC 1 1 1 P ATG 

TTGTGGCTAC AAb i 1 i la i b 


AGCACTGTAG 


CACCAAAGTA 


540 


CCTACTATGT 


GTGTGGACTG 


r«f»TA ATJITr ACAPAISPTPT 

GA6TAATA 1 L AbALAbL 1 L 1 


TGCTGTTTCC 

1 WW 1 W III V*w 


AAATTCCACT 


600 


GTTGGTGACA 


GTGGAGTCCC 


«/*/»Af»/*A/*OT TPT TTPPPA A 
AGCACCACL I 1 L 1 1 1 lllah 


TGCGTCGGAT 

1 ywu i wuur « » 


GCGAGAATCT 


660 


GTTTCCCGGA 


T6CCTGCTAG 


•v-rr+f»f*n/*r' Af* ACATAPTPTA 

TTCCCAGCAC AbA IAL ilia 


PACCCCATGC 


CTTCACTTTC 


720 


AACACCTCCA 


GCCCTTCCTC 


a/* a »/»/»TT/»r f*Tf*TPPPACA 

AGAAGGTTCC ClLlLLLAbA 


GGCAGAGGTC 


AACGTCCACT 


780 


CCCAATGTCC 


ACATGGTCAG 


/« a a rrrTf* PATCTCGAPA 
CACCACCC I u LA 1 b 1 bbALA 


GCAGGATGAT 


TGAGGATGCA 


840 


ATTCGAAGTC 


ACAGTGAATC 


■ ^r>rTPArrT TPACPPPTGT 

AGCCTCACCT 1 LAbLLL i b i 


CCAGCAGCCC 


AAACAACCTG 


900 


GGTCCAACAG 


GCTGGTCACA 


****** niii OCPPTCPPAC 

GCCCAAAACC Cttb I bLLAb 


PAPAAAGAGA 


GCGGGCACCA 


960 


GGATCTGGGA 


CCCAGCAAAA 


iiimiii ATT APCPPTPCTC 

AAACAAAATT AbbLLlLbib 


GGCAGAGAGA 


CTCGAGTTAT 


1020 


TACTGGGAAA 


TAGAAGCCAG 


TGAGGTGATG CTGTCTACIl. 


CGATPGGGTP 
QUHl wwww 1 w 


AGGTTCCTTT 

nuu i i ww iii 


1080 


GGCACTGTGT 


ACAAGGGCAA 


GTGGCATGGA 6ATGTT6CAG 


TAAAGATCCT 


AAAGGTGGTT 


1140 


GACCCAACTC 


CAGAGCAACT 


TCAGGCCTTC AGGAACGAGG 


TGGCTGTTTT 


GCGCAAAACA 


1200 


CGGCATGTTA 


ACATCCTGCT 


GTTCATGGGG TACATGACAA 


AGGACAACCT 


GGCGATTGTG 


1260 


ACTCAGTGGT 


GTGAAGGCAG 


CAGTCTCTAC AAACACCTGC 


AT6TCCAGGA 


GACCAAATTC 


1320 


CAGATGTTCC 


AGCTAATTGA 


CATTGCCC6A CAGACA6CTC 


AGGGAATGGA 


CTATTTGCAT 


1380 


GCAAAGAACA 


TCATCCACAG 


AGACATGAAA TCCAACAATA 


TATTTCTCCA 


TGAAGGCCTC 


1440 


ACGGTGAAAA 


TTGGAGATTT 


TGGTTTGGCA ACAGTGAAGT 


CACGCTGGAG 


T6GTTCTCAG 


1500 


CAGGTTGAAC 


AGCCCACTGG 


CTCTGTGCTG TGGATGGCCC 


CAGAAGTAAT 


CCGGATGCAG 


1560 
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GAT6ACAACC C6TTCAGCTT CCAGTCCGAC GTGTACTCGT ACGGCATCGT GCTGTACGAG 1620 

CTGATGGCTG GGGAGCTTCC CTACGCCCAC ATCAACAACC GAGACCAGAT CATCTTCATG 1680 

GTAGGCCGTG GGTATGCATC CCCTGATCTC AGCAGGCTCT ACAAGAACTG CCCCAAGGCA 1740 

ATGAAGAGGT TGGTGGCTGA CTGTGTGAAG AAAGTCAAAG AAGAGAGACC TTTGTTTCCC 1800 

CAGATCCTGT CTTCCATC6A GCTGCTTCAG CACTCTCTGC CGAAAATCAA CAGGAGCGCC 1860 

TCTGAGCCTT CCCTGCATCG GGCAGCTCAC ACTGAGGACA TCAATGCTTG CACGCTGACT 1920 

ACATCCCCAA GGCTACCAGT CTTCTAG . 1947 

(2) INFORMATION FOR SEQ ID N0:11: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1947 base pairs 

B) TYPE: nucleic acid 

C) STRANDEDNESS: double 

D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1944 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 11: 

ATG GAG CAC ATA CAG GGA GCT TGG AAG ACG ATC AGC AAT GGT TTT GGA 48 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

TTC AAA GAT GCC GTG TTT GAT GGC TCC AGC TGC ATC TCT CCT ACA ATA 96 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

GTT CAG CAG TTT GGC TAT CAG CGC CGG GCA TCA GAT GAT GGC AAA CTC 144 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

ACA GAT CCT TCT AAG ACA AGC AAC ACT ATC CGT GTT TTC TTG CCG AAC 192 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

AAG CAA AGA ACA GTG GTC AAT GTG CGA AAT GGA ATG AGC TTG CAT GAC 240 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 
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TGC CTT AT6 
Cys Leu Met 

GCA GTG TTC 
Ala Val Phe 



AAA GCA CTC AAG 

Lys Ala Leu Lys 
85 

AGA CTT CTC CAC 
Leu His 



Arg Leu 
100 



GAT TGG AAT ACT GAT 

Asp Trp Asn Thr Asp 
115 

GAT TTC CTG GAT CAT 

Asp Phe Leu Asp His 
130 

ACG TTC CTG AAG CTT 
Lys Leu 



GCT GCG 
Ala Ala 
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GTG AGG GGC CTG 

Val Arg Gly Leu 
90 

GAA CAC AAA GGT 

Glu His Lys Gly 
105 

TCT TTG ATT GGA 
He Gly 



Ser Leu 
120 



Thr Phe Leu 
145 

AAT GGA TTT 
Asn Gly Phe 

AGC ACC AAA 
Ser Thr Lys 



GTT CCC CTC ACA 

Val Pro Leu Thr 
135 

GCC TTC TGT GAC 
Cys Asp 



ACA CAC 
Thr His 



CAA CCA GAG TGC TGT 288 

Gin Pro Glu Cys Cys 
95 

AAA AAA GCA CGC TTA 336 

Lys Lys Ala Arg Leu 
110 

GAA GAA CTT CAA GTA 384 

Glu Glu Leu Gin Val 
125 

AAC TTT GCT CGG AAG 432 

Asn Phe Ala Arg Lys 
140 



Ala Phe 
150 



ATC TGT CAG AAA TTC CTG CTC 480 
Gin Lys 



CGA TGT CAG ACT 

Arg Cys Gin Thr 
165 

GTA CCT ACT ATG 
Thr Met 



TGT GGC 
Cys Gly 



He Cys 
155 



TAC AAA TTT CAT 

Tyr Lys Phe His 
170 



Val Pro 
180 



TGT GTG GAC TGG AGT AAC 

Cys Val Asp Trp Ser Asn 
185 



CTC TTA TTG TTT CCA 

Leu Leu Leu Phe Pro 
195 

CTA CCT TCT TTG ACT 

Leu Pro Ser Leu Thr 
210 

CCT GTT AGT TCT CAG 
Ser Gin 



AAT TCC 
Asn Ser 



ACT ATT GGT GAT AGT GGA 
Gly Asp 



Thr He 
200 



Ser Gly 
205 



Phe Leu Leu 
160 

GAG CAC TGT 528 

Glu His Cys 
175 

ATC AGA CAA 576 

He Arg Gin 
190 

GTC CCA GCA 624 
Val Pro Ala 



ATG CGT CGT ATG 

Met Arg Arg Met 
215 



CGA GAG TCT GTT TCC AGG ATG 672 

Arg Glu Ser Val Ser Arg Met 
220 



Pro Val Ser 
225 

AAC ACC TCC 
Asn Thr Ser 



CAC AGA TAT TCT ACA CCT CAC GCC TTC ACC TTT 720 

His Ala 



His Arg 
230 



Tyr Ser Thr Pro 
235 



Phe Thr Phe 
240 



AGT CCC TCA TCT 

Ser Pro Ser Ser 
245 



GAA GGT TCC CTC TCC CAG AGG CAG AGG 768 
Ser Gin 



Glu Gly Ser Leu 
250 



Arg Gin Arg 
255 



TCG ACA TCC ACA CCT AAT GTC CAC ATG GTC AGC ACC ACG CTG CCT GTG 816 
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Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 
260 265 270 

GAC AGC AGG ATG ATT GAG GAT GCA ATT CGA AGT CAC AGC GAA TCA GCC 864 

Asp Ser Arg Met He Glu Asp Ala lie Arg Ser His Ser Glu Ser Ala 
V 275 280 285 

TCA CCT TCA GCC CTG TCC AGT AGC CCC AAC AAT CTG AGC CCA ACA GGC 912 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

TGG TCA CAG CCG AAA ACC CCC GTG CCA GCA CAA AGA GAG CGG GCA CCA 960 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

GTA TCT GGG ACC CAG GAG AAA AAC AAA ATT AGG CCT CGT GGA CAG AGA 1008 

Val Ser Gly Thr Gin Glu Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

GAT TCA AGC TAT TAT TGG GAA ATA GAA GCC AGT GAA GTG ATG CTG TCC 1056 

Asp Ser Ser Tyr Tyr Trp Glu lie Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

ACT CGG ATT GGG TCA GGC TCT TTT GGA ACT GTT TAT AAG GGT AAA TGG 1104 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

CAC GGA GAT GTT GCA GTA AAG ATC CTA AAG GTT GTC GAC CCA ACC CCA 1152 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

GAG CAA TTC CAG GCC TTC AGG AAT GAG GTG GCT GTT CTG CGC AAA ACA 1200 

Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

CGG CAT GTG AAC ATT CTG CTT TTC ATG GGG TAC ATG ACA AAG GAC AAC 1248 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

CTG GCA ATT GTG ACC CAG TGG TGC GAG GGC AGC AGC CTC TAC AAA CAC 1296 

Leu Ala lie Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

CTG CAT GTC CAG GAG ACC AAG TTT CAG ATG TTC CAG CTA ATT GAC ATT 1344 
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Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 w 

GCC CGG CAG ACG GCT CAG GGA ATG GAC TAT TTG CAT GCA AAG AAC ATC 1392 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 <tou 

ATC CAT AGA GAC ATG AAA TCC AAC AAT ATA TTT CTC CAT GAA GGC TTA 1440 
lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 

ACA GTG AAA ATT GGA GAT TTT GGT TTG GCA ACA GTA AAG TCA CGC TGG 1488 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 

AGT GGT TCT CAG CAG GTT GAA CAA CCT ACT GGC TCT GTC CTC TGG ATG 1536 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 505 
GCC CCA GAG GTG ATC CGA ATG CAG GAT AAC AAC CCA TTC AGT TTC CAG 1584 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 5« 

TCG GAT GTC TAC TCC TAT GGC ATC GTA TTG TAT GAA CTG ATG ACG GGG 1632 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 
530 535 MU 

GAG CTT CCT TAT TCT CAC ATC AAC AAC CGA GAT CAG ATC ATC TTC ATG 1680 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 s " 

GTG GGC CGA GGA TAT GCC TCC CCA GAT CTT AGT AAG CTA TAT AAG AAC 1728 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
565 3/u 

TGC CCC AAA GCA ATG AAG AGG CTG GTA GCT GAC TGT GTG AAG AAA GTA 1776 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 

580 585 
AAG GAA GAG AGG CCT CTT TTT CCC CAG ATC CTG TCT TCC ATT GAG CTG 1824 
Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 

CTC CAA CAC TCT CTA CCG AAG ATC AAC CGG AGC GCT TCC GAG CCA TCC 1872 
Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
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610 615 620 

TTG CAT CGG GCA GCC CAC ACT GAG GAT ATC AAT GCT TGC ACG CTG ACC 1920 

Leu His Arg Ala Ala His Thr Glu Asp lie Asn Ala Cys Thr Leu Thr 
625 630 635 640 

ACG TCC CCG AGG CTG CCT GTC TTC TAG 1947 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
15 10 15 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 * 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 



Leu Leu Leu Phe Pro Asn Ser Thr He Gly Asp Ser Gly Val Pro Ala 
jg5 200 £>UJ 

Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 
Pro Val Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 25° 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 
260 265 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 " J 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 

290 295 
Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Val Ser Gly Thr Gin Glu Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 • 360 ■"=» 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 
Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
' 405 410 

Leu Ala lie Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
Aon 425 « w 



Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 «*» 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
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Ile His Arg Asp Het Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser VaT Leu Trp Met 
500 505 510 

Ala Pro Glu Val lie Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 
530 " 535 540 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin He He Phe Met 
545 * 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 60S 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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WHAT IS rT.ATMED IS : 

1 . A method of identifying an individual at an 
increased risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying an individual having one or more 
mutations in said region as having an increased risk for 
developing cancer. 

2 The method according to claim 1 , wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
N0:12. 

3. The method according to claim 2, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
110:12. 

4. The method according to claim 3, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
N0:12. 

5. The method according to claim 1, wherein said 
products are analyzed by DNA sequencing. 

6. The method according to claim 1, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR). 

7. The method according to claim 6, wherein said 
PCR employs a primer comprising SEQ ID. NO:7 and a primer 
comprising SEQ ID NO: 8. 

8. A method for determining a prognosis in 
patients afflicted with cancer, comprising: 
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amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying a patient having no mutation in said 
region as being less likely to suffer disease relapse or 
having an increased chance of survival than a patient 
having one or more mutations in said region. 

9. The method according to claim 8, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
N0:12. 

10. The method according to claim 9, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO: 12. 

11. The method according to claim 10, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
N0:12. 

12. The method according to claim 9, wherein said 
products are analyzed by DNA sequencing. 

13. The method according to claim 9, wherein said 
amplification is effected using polymerase chain 
reaction (PCR). 

14. The method according to claim 13, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

15. A method for determining the proper course of 
treatment for a patient afflicted with cancer, 
comprising: 

amplifying a region of the c-raf-1 gene of said 
patient; 
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analyzing products of said amplification for 
evidence of mutation; 

identifying a patient having at least one mutation 
in said region, which patient may require treatment 
proper for a patient having a lesser chance of survival 
or decreased time to relapse; and 

identifying a patient lacking mutations in said 
region, which patient may require treatment proper for a 
patient having a greater chance of survival or being 
less likely to suffer disease relapse. 

16- The method according to claim 15, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
N0:12. 

17. The method according to claim 16, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
N0:12. 

18. The method according to claim 17, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO:12. 

19. The method according to claim 16, wherein said 
products are analyzed by DNA sequencing. 

20. The method according to claim 16, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR). 

21- The method according to claim 20, wherein said 
PCR employs a primer comprising SEQ ID N0:7 and a primer 
comprising SEQ ID NO: 8. 
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Figure 4 
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Figure 6 
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Either Genomic DNA or cONA 

Nucleotide 1307 of coding Nucleotide 1326 of coding 

Primer 1 = 5'- AGGAGACCAAGTTTCAGATG -3 1 

Nucleotide 1915 of coding Nucleotide 1896 of coding 

I I 

Primer 2 = 5'- GCGTGCAAGCATTGATATCC -3' 
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